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Title: A method for in vivo production of a mutant, library in 
cells 

5 FIELD OF THE INVENTION 

The present invention relates to methods for in vivo production 
of libraries of polypeptide variants, the screening of these 
variants and selection of those exhibiting desired properties, 
o The invention furthermore relates to methods for producing the 
desired polypeptide variants. 

BACKGROUND OF THE INVENTION 

5 An increasing number of polypeptides , including enzymes and non- 
enzymatic proteins, are being produced industrially, for use. in 
various industries, household, food/feed, cosmetics, medicine 
etc. One of the major sources for these proteins is and have been 
microorganism found in nature. 

o 

The classical approach for finding polypeptides with new and 
special properties, have been to screen wild type organisms 
present in nature. This has been a very successful way of 
procuring polypeptides to be used in such diverse areas as the 
s above mentioned applications. 

However, often it has not been possible to produce such 
polypeptides in sufficient amounts because the quantities 
produced in the natural host systems were too minute to allow a 
o production, and even if the cost was no problem, difficulties 
could be encountered in providing sufficient amounts in relation 
to the demand (e.g. human growth hormone) . 

Such problems have to a large degree been overcome by the advent 
5 of recombinant techniques for the production of polypeptides. In 
this art polypeptides are produced by the use of biological 
systems. Genes encoding certain polypeptides are cloned and 
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transferred into cells that will produce the polypeptides in 
quantities much larger than those, wherein they are produced in 
the original organism. Over the latest twenty years a large 
number of methods for the production of polypeptides according to 
5 such techniques have been developed. 

Often, proteins from natural sources do not meet the requirements 
for certain applications, and it will be necessary to modify 
existing proteins towards certain activities or biophysical 
10 properties. 

It is possible to generate new variants of a protein by classical 
mutagenesis of the microorganism using radiation (X-ray and UV) 
or chemical mutagens. However, since this approach is a very 
is labour and time consuming process, in the same last two decades 
researchers have been developing improvements on existing 
oolvoeptides by using more specific and selective recombinant 
techniques, such as protein and genetic engineering for creating 
artificial diversity. 

20 

Bas=d uoon considerations using knowledge of the structure- 
function relationships and general protein chemistry, researchers 
have come a long way in designing' polypeptide variants exhibiting 

ir.crovement s in various properties , 

25 

.^iisea -hat 'the various interactions 
^ov.'^ver, it nas dis^ ~ - ~ - 

into which oolvpeptides take part, are so complex that rational 
desicn according to such knowledge has serious limitations, and 
in recent years methods employing random mutagenesis followed by 

c ^lor^on f-n- larqe numbers of variants 

30 screening of or seiecuxon j.-o... . _-y 

oroduced therefrom has gained interest. 



• • „ - -.-w-™, 0 f mutants is qenerated 'for 
For this purpose a microDia^ .-^-c^y °~ lllULdUL:j ^ * 

• ^ _ * r -^i nq to determine variants 
subsequent expression an- 

35 possessing the desired properties. 
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Over the- years many both in vitro and in vivo DNA mutagenesis 
techniques for creating high numbers of different variants of 
polypeptides have been developed. 

s Considering the fact that a typical naturally occurring 
polypeptide consists of between 100 and 1000 amino acids, and 
each may be varied in 20 ways (only to stay within the naturally 
occurring amino acids) , the number of possible variants for a 
specific polypeptide is enormous. Since the main parameter that 

d defines or measures the usefulness of a microbial collection or 
library used to identify improved variants of polypeptide is the 
number of different variants, N, which is comprised in the 
collection, a need for large libraries has emerged. 

5 Especially in cases when a powerful selection system is 
available, the limiting factor for the identification of the 
desired polypeptide is the size of "he library. 

In in vitro systems the practical, state of art, limit for N is 
) about 10 8 . This is mainly due to inefficiency of transf ormat ion 
(introduction of DNA into the cell) of the manipulated DNA into 
the host organism. This number varies a lot from organism to 
organism: in the presently best case, E . coli, the usual 
efficiency of transformation cf in vitro manipulated DNA, e.g. a 
; ligation of DNA fragments or chemical treatment of DNA, leads at 
the most to library sizes up to 10 e bacteria (Greg Winter, 
Current methods in Inir^jnology 5: 253-255, 1993). Very few 
examples of libraries of this size have been reported.' 

D In vitro library constructions in other prokaryotes, such as 
Bacillus sp. , Streptococcus sp. or Staphylococcus sp. will for 
practical reasons be orders of magnitude below this number. 

Considering eukaryotic hosts such as Saccharomyces cerevisiae or 
5 various Aspergillus sp., an even lower number of transf ormants 
can be expected from in vitro manipulated DNA. 
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A special case of a large library has been reported based on in 
vivo recombination between libraries of antibody light and heavy 
chains based on a speciallv designed system useful for that 
5 particular case (Griffiths, A.D. et al . , 1994, EMBO J. 14: 3245- 

3260) . 

A number of methods are available to generate variants of a 
polypeptide in microorganisms in vivo, ranging from very simple, 

10 such as treating cells with chemical or physical mutagens, to 
rather complex, relying on cells that contain an error-prone DNA 
polymerase but lack the mismatch repair system which corrects the 
errors (Stratagene, XLl-red (mutS, xnutD, wutT) Catalog #200129) . 
But these techniques have a major drawback as the mutagenesis is 

is not targeted to a specific part of the genome (coding for the 
polypeptide of interest) and high frequencies of mutations are 
generated also in essential genes for the cell as well as in the 
target gene, resulting in rr.assive ceil death, together with a 
high number of cells, where the mutations do not influence the 

20 polypeptide of interest. Such "noise" will limit the accumulation 
of mutations in the target region. 

It is therefore the object of the invention to provide an in vivo 
target region- speci f i c mutagenesis procedure in order to produce 
25 verv large numbers, of polypeptide variants. 

A second object of the invention relates to the screening or 
selection of variants with the desired properties, both by 
existing and future technologies. 

30 



SlfrlMARY OF THE INVENTION 

The present invention therefore relates to a method for in vivo 
35 production of a library in cells comprising a multitude of 
mutated genetic elements, wherein an error-prone polymerase is 
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used in * each ancestral cell to replicate all or a part of a 
genetic element comprising 

i) an origin of replication from which replication is 
initiated, 

ii) optionally a genetic marker, e.g. a gene conferring 
resistance towards an antibiotic, 

iii) a gene encoding the polypeptide of interest, 
independently of the host chromosomal replication machinery. 

The invention furthermore relates to a method for the generation 
of a DNA sequence encoding a desired variant of a polypeptide of 
interest, wherein 

i) a mutant library is produced by the above method, 

ii) said library is cultivated under conditions conducive for, 
the .expression of said gene of interest to produce 
polypeptide variants , 

iii) said variant polypeptides are screened or selected for a 
desired property, and hosts producing such desired 
variants identifier" and isolated, 

iv) said genetic element in said hosts is sequenced to 
elucidate the DNA sequence of the mutant gene encoding a 
desired variant. 

and a method for the determination of the DNA sequence encoding a 
desired variant of a polypeptide of interest, wherein 

(i) a mutant library is produced by the above method, 

(ii) said library is cultivated under conditions conducive for 
the expression of said gene cf interest to produce variant 
polypeptides , 

(iii) said variant polypeptides are screened or selected for a 
desired property, and hosts producing desired variants 
identified and isolated, • 

(iv) said genetic element in said hosts is sequenced to 
elucidate the DNA sequence of the mutant gene encoding a 
desired variant. 
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The screening of the library or the selection of the variants 
depends on the specific polypeptide and which, properties thereof 
it is desired to improve and/or retain. It is therefore necessary 
to set up a screening protocol for each case. Such protocols 
5 involving a number of assays are described in the literature 
(Clackson et al., Nature 352:624-628, 1991, Bryan, P et al. t 
Proteins 1:326-334, 1986). 

An elegant approach to the combination of the generation of 
10 diversity and the selection of variants with the desired 
properties would be a combination of the in vivo method of the 
invention for generating the diversity with a phage display 
system (Greg Winter, Supra) . 

is A specific example of a polypeptide of interest is the alkaline 
proteases used in the detergent industry for the removal of 
proteinaceous stains from fabric. In that case the screening may 
be performed in actual detergent compositions to investigate 
properties such as thermal stability, oxidation stability, 

20 storage stability, substrate specificity and- affinity, stability 
to non-aqueous solvents, pH profile, ionic strength dependence, 
catalytic efficiency, and wash performance . 

Furthermore the invention relates to a process for the production 
25 of a desired polypeptide variant, wherein 

(i) a DNA sequence encoding a polypeptide of interest that has 
been determined according to the method above is intro- 
duced into a suitable host in a manner whereby it can be 
expressed in said host, 
30 (ii) said host is cultivated under conditions conducive to the 
expression of said DNA sequence, and 
(iii) said polypeptide variant is recovered. 



Methods for the introduction of the DNA sequence selected into 
35 suitable host systems are described in, (Sambrook et al . 
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Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Lab., 
Cold Spring Harbor, NY.). 

It is" also within the abilities of the skilled person to select 
5 suitable growth media and other conditions for the host system 
selected that are conducive for the expression of the polypeptide 
variant od interest. Guidance hereto may f. ex. be found in 
(Sambrook et al . , supra). 

o Also for the recovery of the polypeptide a large number of 
methods are available for the separation and purification of 
proteins, e.g. in (Scopes, R.K., protein Purification (1987), 
Springe r-Ver lag) 

5 Lastly, the invention relates to the polypeptides produced by the> 
above method. 



DETAILED DESCRIPTION OF THE INVENTION 

0 

The invention co-prises a method to construct in vivo libraries 
of variants in a gene of interest. The method involves the use of 
a genetic element, such as a bacteriophage or a plasmid that is 
able to replicate independently of the host chromosomal 

5 replication system. . By the use of the possibility to separate the 
replication of the host chromosome and 'the replication of the 
genetic element (phage/plasmid) , it is possible by modifications 
of one replication system to selectively introduce mutations in 
the genetic element (phage/plasmid) keeping the chromosome of the 

o host intact. This means that the generation of variation in the 
gene of interest does not compromise the viability of the host. 

DNA replication is a highly accurate process, and the 
misincorporat ion rate of the chromosomal replication in E. coli 
5 has been estimated to be in the order of 10" 10 pr. base pr. round 
of replication. The base pairing carried out during DNA 
replication leads to a preference for the polymerase to 
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iacorporate the correct base at a certain position, which 
accounts for approximately 10 s of the overall replication 
fidelity. If an incorrect base has been incorporated, the 
polymerase will stall and a 3-5' exonuclease will often remove 
s the 3- reincorporated base. This part denoted proof-reading 
accounts for approximately 10' of the overall replication 
fidelity. The repair system of the cell accounts for the last 10 J 
of the fidelity rate. 

10 Replication of the chromosome in E. coli is for the most part 
carried out by DNA polymerase III holoenzyme, which is a multi- 
protein complex containing 10 different polypeptides including a 

f-,i„v,=. cnh niir doIC aene) and a 3' -5' exonuclease 
polymerase (alpha sun-unic, pjit y_n=< 

(dnaO gene) . 

^ A further polymerase, DNA polymerase I (DNA pol I, polA gene), 

contains three different activities. viz. a DNA polymerase 

activity, a 3' -5' exonuclease activity, and a 5-3' exonuclease 

. . > ._ f--- -ra- •- s on° single polypeptide. This 

activity despite tne roc ur.a^ -s> <->• — SJ -' a - ^ 

-i ; -i cell Besides DNA reDair, 

20 polymerase has several rur.c,.o.:= x.. u..- ceii. 

DNA ool I is also needed for the chromosomal DNA replication, as 
<t is involved in the assembly of DNA fragments during synthesis 
of the lagging DNA strar.d. However, it replicates only a very 

-ir.or oortion of the cencrr.e. 



This polymerase is furthermore involved in initiation of DNA 
-eo^ cation of certain classes of plasmids, e.g. ColEI origin of 
reoUcation-based plasmids such as P 3R322 in Escherichia coli and 
Gram-negative bacteria or pAMSl like plasmids in Gram-positive 
30 bacteria. Such plasmids may be able to replicate completely 
through the activity of DNA polymerase I without DNA polymerase 

;r, ,-m V3 f 0 rn e.cr. if this enzyme is 
III being present m a.ave ~o.m, e.y- 

dysfunctional due to genetic causes, e.g. temperature sensitive 
variants at a non-permissive temperature), or under conditions 
35 where only limited amounts of DNA Polymerase III are present m 
the cell. 
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It is well known that certain mutations lead to a decrease (or an 
increase) in the fidelity of DNA replication. These mutations 
have been mapped to reside mainly in polymerases, exonucleases or 
s in elements of the repair system. Unfortunately, most of these 
mutations alter/impair the fidelity rate for the complete genome 
present, and such non- targeted mutations are not desirable. 

One example of such a mutation could be an inactivation of the 
10 3' -5* exonuclease activity of DNA pol I. 

However, according to the invention use is being made of the fact 
that some elements in the replication system may be temporarily 
"switched" off, fully or partially, thereby stopping or greatly 
is slowing down the replication of the genome, while replication of 
certain genetic elements as defined herein is continued. 

An E. col i strain containing a temperature sensitive DNA pol III 
(i.e. the polymerase a-sub-unit :r another temperature sensitive 

20 sub-unit - that render the holoenzyme conditionally non- 
functional), or a function required for initiation of chromosomal 
replication, such as DnaA, an error prone DNA pol I and a colEI 
based plasmid containing a gene of interest, is an example of a 
genetic system according tc the" invention designed to specif i- 

25 cally introduce mutations in the plasmid (and the gens of 
interest) . 

In st*i'h a system raising the temperature to a non-permissive 
value will have the effect that DNA pol III cer.ses to function 
30 fully, while the error prone DNA pol I will retain its function 
and replicates the plasmid with reduced fidelity"" resulting in 
mutated copies of the plasmid. 

Since the generation of mutations is random, each cell will 
35 generate unique mutations and upon lowering the temperature, the 
temperature sensitive function will become active again, and 
normal replication of the cells continue. 
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A variation- would be an E. coli strain with temperature sensitive 
alleles of polIII and poll and an inducible expression (by a ts 
repressor (temperature-sensitive) or by chemical induction) of 
the error-prone polymerase. At a restrictive temperature and the 
presence of the inducer mutations will accumulate in the genetic 
element. At permissive temperature and the absence of the 
inducer, the complete systems functions as the wild type cell. 

Accordingly the invention in its first aspect relates to a method 
for in vivo production of a mutant library in cells comprising a 
multitude of mutated genetic elements, wherein an error-prone 
polymerase is used in each ancestral cell to replicate all or a 
part of a genetic element comprising 

i} an origin of replication from which replication is 

initiated, 

ii) optionally a genetic marker, e.g. a gene conferrinc 

resistance towards ar. antibiotic, 

iii) a gene encoding the polypeptide of interest, 
independently of the host chromosomal replication machinery. 

The invention consequently comprises a method for in vivo 

production of a library ir. cells comprising a multitude of 

mutated genetic elements comprising 

A) providing a cen r.avir.g 

i) an error-prone polymerase that independently of the 

chromosomal replication machinery of said cell will 
reolicate ail or a part of a genetic element 
comprising 

3 a) an origin cf replication from which replication 

is initiated, 

b) optionally a genetic marker, e.g. a gene 
conferrinc resistance towards an antibiotic, 

c) a aene encoding the polypeptide of interest, 
e and 
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ii) a chromosomal replication machinery that can be 
reversibly induced to be substantially non- 
functional , 

B) ^growing such a cell under conditions conducive to its 
5 replication to obtain a multitude of ancestral cells, 

C) reversibly inducing said chromosomal replication machinery 
in said ancestral cells to be substantially non-functional 
for a period of time sufficient to allow for the 
replication of said genetic element by said error-prone 

io polymerase to generate mutations in said genetic element, 

D) reversibly inducing said chromosomal replication machinery 
in such mutated cells to be substantially functional, and 

E) growing such mutated cells "under conditions conducive to 
their replication. 

15 

In this context the expression "mutant library" means a set -of 
cells, bacteria or phages (typically 10 s to 10 n cells or phages) 
that differs with respect to one particular gene encoding a 
polypeptide of interest. Typically one would like to introduce 
20 one or more different amino acid alterations in this particular 
polypeptide in each member of the library. 

In this context the expression "error-prone polymerase" means a 
polymerase that during DNA replication will incorporate mistakes 
25 (one of the wrong nucleotides in a given position or cause a 
deletion or an insertion of one or several nucleotides) with 
higher frequency than the polymerase normally used for this 
purpose (e.g. E. coli DNA pol I, Bacillus subtilis DNA pol I, T4 
DNA polymerase, T7 DNA polymerase) . 

30 

The expression "ancestral cell" here means such cells wherein no 
mutations have been introduced. In some embodiments of the 
invention the mutation cycle may be reiterated, and in that case, 
such cells that were initially mutated become ancestral cells for 
35 the second mutation cycle, etc. 
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In this context the expression "host chromosomal replication 
machinery" means the DNA polymerase or DNA polymerase holoenzyme 
that is mainly responsible for the replication of the host 
chromosome, e.g. DNA polymerase III in E. coli. 

5 

In this context the expression "genetic element" means a small 
(from 1 or 2 kilo bases to 100 kilo bases) entity consisting of 
RNA or DNA, that is able to replicate independently, i.e. it 
contains an origin of replication. The genetic element would 
o typically be a bacteriophage, a phagemid, or a plasmid. The 
genetic element must also according to the invention comprise a 
gene encoding the polypeptide of- interest, and it may further 
comprise a genetic marker, e.g. a gene conferring resistance 
towards an antibiotic. 

5 

A virus, a retrovirus, or a transposes that is able to replicate 
independently of the host replication machinery, e.g. 
retrotransposons could also be used as the "genetic element". 

o Kim and Loeb (1995, PNAS 92: 634-533) have demonstrated that HIV 
reverse transcriptase .{HIV-RT) is able to complement E. coli DNA 
pol I with respect to chromosomal d::a replication and initiation 
of plasmid DNA replicat icr. . The reincorporation rate of HIV-RT 
(and related retroviral reverse transcriptases) is several orders 
5 of magnitude higher than the rate of DNA pol I, i.e. 10° to 1<T* 
reincorporations pr. base pr . round of replication. The use of 
such a polymerase in stead of a mutated error prone E. coli DNA 
pol I in an embodiment of the invention would significantly 
increase the frequency of replication errors in the system 
so described above. 

In a further embodiment of the invention the mutation cycle 
described above can be reiterated, i.e. the mutagenic polymerase 
switched on and off several times, thereby generating even more 
35 mutants. Such a step could furthermore help the segregation of 
Dlasmids if a multicopy plasmid is used as the genetic element. 
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In certain genetic elements one can envision that only the part 
of the genetic element located in the vicinity of the origin of 
replication is replicated by the error-prone polymerase. In such 
5 cases, the gene of interest should be situated within this 
region . 

In a specific embodiment of the invention the genetic element is 
a phage, wherein the gene encoding the polypeptide of interest is 

10 positioned at a locus where the polypeptide upon expression is 
displayed from the surface of the phage, whereby a screening can 
be performed directly (see Greg Winther, supra) . To ensure the 
correspondence between DNA sequence of the phage and the protein , 
displayed the primary phage stock should be passed through wild 

15 type E. coli, at low multiplicity of infection, prior to 
selection or screening. 

To further increase the frequency cf mutations, the method of the..' 
invention comprises embodiments v.- he re the method is used in 
20 conjunction with a repair deficient host, e.g. mutL, mutS, /nuttf, 
or a combination of mutator genome types. 

In this context the expression "repair deficient host" means a 
cell containing one cr more alterations in genes encoding 

:s proteins known to be directly or indirectly involved in the DNA 
repair. The result of such mutations is that a higher frequency 
of introduced mutations (by the polymerases, chemicals, X-ray, UV 
light, etc.) will not be repaired and will be "permanently" 
incorporated in the genome, the so called mutator phenotype. 

30 Examples of such genes are mutL, mutS, iuutH, mutT. 

As the genetic element one ecu Id as indicated above use a 
phagemid in stead of a plasmid in order to couple the variant 
generation to a display system, e.g. M13, fl, fd. 

35 

In this context the expression "phagemid" means a plasmid that 
besides its plasmid origin of replication contains a phage origin 
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of replication, phagemids are dependent of the conditions able to 
replicate as a plasmid or as a phage (upon infection with a 
helper-phage) . 

5 A phagemid based system would involve the construction of a 
phagemid containing: 

1) a plasmid origin of replication, e.g. ColEI 

2) a M13 phage origin of replication 

3) a chimeric gene consisting of the gene of interest fused 
io to the gene encoding GUI protein. (0rum, P. et al . , Nucl . 

Acid. Res. 21: 4491-4498, 1993). 

The first step would be the generation of diversity by 
growing/maintaining an E. coli strain transformed with this 
15 phagemid as' described above. The second step would be the 
infection with the helper phage in order to create single 
stranded phagemid that will be packed into phage particles. The 
phages displaying the variant proteins can then be subjected to a 
selection procedure. 

20 

Also, certain bacteriophages such as T4 or T7 in Eschericia or 
SPOII and ?hi29 in Bacillus contain their own DMA polymerases, 
and according to the invention one could envision embodiments 
where the aenetic element described above is a bacteriophage 
25 containing an error-prone C.'A polymerase . 

According to the invention the error-prone polymerase is 
typically selected from the group comprising DNA polymerase I or 
reverse transcriptases. A preferred error-prone polymerase is a 
30 variant of E. coli DNA polymerase I or HIV reverse transcriptase. 

As a polypeptide of interest a large number is possible, and 
esoecially such polypeptides exhibiting biological activities 
could* be mentioned. Among these are enzymes, hormones, receptors, 
35 blood-clotting factors, anti -microbial agents, and other such 
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polypeptides important for the prophylaxis and treatment of 
various disorders and diseases in humans and animals. 

Also, enzymes used for industrial purposes could be mentioned. 
5 Among such industrial enzymes, enzymes belonging to the groups 
carbonyl hydrolases, carbohydrases , oxidoreductases, trans- 
ferases, phytases, anti-microbial polypeptides, oxidoreductases, 
isomerases, lyases, and ligases. 

10 In this context the expression "carbonyl hydrolase" means enzymes 
that hydrolyze compounds containing a -C(=0)-X group, where X is 
oxygen or nitrogen. 

Specific classes of enzymes belonging to the group of carbonyl 
is hydrolases are such as hydrolases (lipases) and peptide 
hydrolases (proteases) . 

Proteases are here meant as er.:pe5 classified under the Enzyme 
Classification number E.C. 3.4 in accordance with the 
20 Recommendations (1992) of the International Union of Biochemistry ' 
and Molecular Biology (IU3MB) . 

Examoles include proteases selected from those classified under 
the Enzyme Classification (E.G.) numbers: 

25 

3.4.11 (i.e. so-called aminopepridases) , including 3,4.11.5 
(Prolyl aminopeptidase) , 3.4.11.9 (X-pro aminopept idase) , 
3.4.11.10 (3acterial leucyl aminopeptidase), 3.4.11.12 
(Thermophilic aminopeptidase) , 3.4.11.15 (Lysyl aminopeptidase) , 
30 3.4.11.17 (Tryptophan-/! aminopeptidase), 3.4.11.18 (Methionyl 
aminopeptidase) . 

3.4.21 (i.e. so-called serine endopept idases) , including 3.4.21.1 
(Chymotrypsin) , 3.4.21.4 (Trypsin), 3.4.21.25 (Cucumisin) , 
35 3.4.21.32 (Brachyurin) , 3.4.21.48 (Cerevisin) and 3.4.21.62 
(Subtilisin) ; 
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3.4.22 (i.e. so-called cysteine endopeptidases) , including 
3.4.22.^2 (Papain) , 3.4.22.3 (Ficain) , 3.4.22.6 (Chymopapain) , 
3.4.22.7 (Asclepain) , 3.4.22.14 (Actinidain) , 3.4.22.30 

5 (Caricain) and 3.4.22.31 (Ananain) ; 

3.4.23 (i.e. so-called aspartic endopeptidases), including 
3.4.23.1 (Pepsin A), 3.4.23.18 (Aspergillopepsin I), 3.4.23.20 
(Penicillopepsin) and 3.4.23.25 (Saccharopepsin) ; and 

o 

3.4.24 (i.e. so-called metallo endopeptidases), including 
3.4.24.28 (Bacillolysin) . 

Examples of relevant subtilisins comprise subtilisin BPN 1 , 
5 subtilisin amylosacchari t icus , subtilisin 168, subtilisin 
mesentericopeptidase, subtilisin Carlsberg, subtilisin DY, 
subtilisin 309, subtilisin 147, thermitase, aqualysin, Bacillus 
PB92 protease, proteinase K, Protease 7W7, and Protease TW3 . 

o Specific examples of such readily available commercial proteases 
include Esperase®, Alcalase®, Neutrase®, Dyrazym®, Savinase®, 
Pyrase®, Pancreatic Trypsin NOVO (PTN) , Bio-Feed™ Pro, Clear- 
Lens Pro (all enzym.es available from Novo Nordisk A/S) . 

5 Examples of other commercial proteases include Maxatase®, 
Maxacal®, Maxapem® marketed by Gist -Brocades N.V., Opticlean© 
marketed by Solvay et Cie. and Purafect® marketed by Genencor 
International . 

10 It is to be understood that also protease variants are 
contemplated as the polypeptide of interest. Examples of such 
protease variants are disclosed in EP 130.756 (Genentech) , EP 
214.435 (Henkel), WO 87/04461 (Amgen) , WO 87/05050 (Genex) , EP 
251.446 (Genencor), EP 260.105 (Genencor), Thomas et al . , (1985), 

35 Nature. 318, p. 375-376, Thomas et al., (1987), J. Mol. Biol., 
193, pp. 803-813, Russel et al., (1987), Nature, 328, p. 496-500, 
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WO 88/08028 (Genex) , WO 88/08033 (Amgen) , WO 89/06279 (Novo 
Nordisk A/S) , WO 91/00345 (Novo Nordisk A/S) , EP 525 610 
(Solvay) and WO 94/02618 (Gist-Brocades N.V.). 

5 The activity of proteases can be determined as described in 
"Methods of Enzymatic Analysis", third edition, 1984, Verlag 
Chemie, Weinheim, vol. 5. 

Lipases are here meant as enzymes classified under the Enzyme 
10 Classification number E.C. 3.1.1 (Carboxylic Ester Hydrolases) in 
accordance with the Recommendations (1992) of the International 
Union of Biochemistry and Molecular Biology (IUBMB) . 

Examples include lipases selected from those classified under the 
is Enzyme Classification (E.C.) numbers: 

3.1.1 (i.e. so-called Carboxylic Ester Hydrolases), including 
(3.1.1.3) Triacylglycerol lipases, (3.1.1.4.) Phospholipase A. _ 

20 Examples of lipases include lipases derived from the following 
microorganisms. The indicated patent publications are in- 
corporated herein by reference: 

Hwvicola, e.g. H. brevispora, H. lanuginosa, H. brevis var. 

therrr.oidea and H. insoler.s (US 4,810,414) 

2 5 

Pseudo^onas , e.g. Ps . tragi, Ps . stutzeri, Ps. cepacia and Ps. 
fluorescens (WO 89/04361), or Ps . plantarii or Ps . gladioli (US 
patent no. 4,950,417 (Solvay enzymes)) or Ps . alcaligenes and Ps. 
pseudoalcaligenes (E? 218 272) or Ps. mendocina (WO 88/09367; US 
30 5,389,536) . 

Fusari um, e.g. F. oxysporum (E? 13 0,064) or F. solani pisi (WO 
90/09446) . 

35 Mucor (also called Rhizomucor) , e.g. M. miehei (EP 238 023). 
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Chromobacterium {especially C. viscosum) 
Aspergillus (especially A. niger) . 

5 Candida, e.g. C. cylindracea (also called C. rugosa) or C. 
antarctica (WO 88/02775) or C. antarctica lipase A or B (WO 
94/01541 and WO 89/02916) . 

Geotricum, e.g. G. candidu/n (Schimada et al . , (1989), J . 
o Eioche/n., 106, 383-388) 

Penici22iu/n, e.g. P. ca/nembertii (Yamaguchi et al . , (1991), Gene 
103, 61-67) . 

5 Rhizopus, e.g. R. delemar (Hass e: al., (1991), Gene 109, 107- 
113) or R . niveus (Kugimiya ec al., (1992) Biosci . Biotech. 
Biochem 56, 716-719) or R. oryzae . 

Bacillus, e.g. 3. subtilis (Cartels et al . , (1993) Biochemica et 
o Biophysica acta 1131, 253-260) or 3. stearothermophilus (J? 
64/7744992) or 3. pumilus (WO 91/16422) . 

Specific examples cf readily available commercial lipases include 
Lipoiase®, Lipoiase™ Ultra, Lipozyme©, Palatase®, Novozym® 435, 
5 Lecitase© (all available from Novo Nordisk A/S) . 

Examples of other lipases are Lumafast™, Ps . mendocina lipase 

from Genencor Int. Inc.; Lipomax™, Ps. pseudoalcaligenes lipase 

from Gist Brocades/Genencor Int. Inc.; FusariLun solani lipase 

io (cutinase) from Unilever; Bacillus sp. lipase from Solvay 
enzymes. Other lipases are available from other companies. 
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The activity of -the lipase can be determined as described in 
"Methods of Enzymatic Analysis", Third Edition, 1984, Verlag 
Chemie, Weinhein, vol. 4, or as described in AF 95/5 GB (avail- 
5 able on request from Novo Nordisk A/S) . 

In this context the expression "carbohydrase" means all enzymes 
capable of breaking down carbohydrate chains (e.g. starches) of 
especially five and six member ring structures (i.e. enzymes 

10 classified under the Enzyme Classification number E.C. 3.2 
(glycosidases) in accordance with the Recommendations (1992) of 
the International Union of Biochemistry and Molecular Biology 
(IUBMB) ) . Also included in the group of carbohydrases according 
to the invention are enzymes capable of isomerizing carbohydrates 

is e.g. six member ring structures, such as D-glucose to e.g. five 
member ring structures like D-fructose. 

Examples include carbohydrases selected from those classified 
under the Enzyme Classification (E.C.) numbers: 

20 

a-amylase (3.2.1.1) > p-amylase (3.2.1.2), glucan 1,4-a- 
glucosidase (3.2.1.3), celiulase (3.2.1.4), endo-l,3(4)- p. 
glucanase (3.2.1.6), endo- 1 # 4 -P-xylanase (3.2.1.8), dextranase 

(3.2.1.11), chitinase (3.2.1,14), polygalacturonase (3.2.1.15), 
25 lysozyme (3.2.1.17), P - clucosidase (3.2.1.21), a-galactosidase 

(3.2.1.22), P-galactosidase (3.2.1.23), amyio-1 , 6 -glucosidase 

(3.2.1.33), xylan 1 , 4 - -xylosidase (3.2.1.37), glucan endo-1 , 3-0- 
D-glucosidase (3.2.1.39) , a-dextrin endo-1, 6-^iucosidase 

(3.2.1.41),- sucrose a-glucosidase (3.2.1.48), glucan endo-l,3-a- 
30 glucosidase (3.2.1.59), glucan 1 , 4 ~P-glucosidase (3.2.1.74), 
glucan endo- 1 , 6 -P-glucosidase (3.2.1.75), arabinan endo-l,5-a- 

arabinosidase (3.2.1.99), lactase (3.2.1.108), chitonanase 

(3.2.1.132) and xylose isomerase (5.3.1.5). 

35 Examples of relevant carbohydrases include a-1, 3-glucanases 
derived from Trichoderma harzianum; ct-l, 6-glucanases derived from 
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arabinosidase H.2.I.M. lactase (3.2.I.U.I. chUonanase 
(3.2.1.132) and xylose isomerase (5.3.1.5). 

Examples of relevant carbohydrases include a-1, 3-glucanases 
derived from Trichoderma harzianum; a-1 , 6-glucanases derived from 
a strain of Paecilomyces ; p-glucanases derived from Bacillus 
subtilis; p-glucanases derived from Humicola insolens; p- 
glucanases derived from Aspergillus niger; p-glucanases derived 
from a strain of Trichoderma; p-glucanases derived from a strain 
of OersJcovia xanthineolytica; exo-1, 4-a-D-glucosidases (gluco- 
amylases) derived from Aspergillus -niger; a amylases derived fro* 
Bacillus subtilis; a-amylases derived from Bacillus 

* ■ ■ n amvb^q derived from Bacillus 

amyloliquefaciens; a-amylas-s °- ri 

stearothermophilus; a-amylases derived from Aspergillus oryzae; 
a-amylases derived from non-pathogenic microorganisms; a- 

■ --o-ti 7 lus niqer; Pentosanases , 
galactosidases aenvea rrom ^s.g^ius nig. , 

rBllnhiases c-Uulases, hemi-cellulases derived from 
xylanases, cellooiases, < 

Humicola insolens; cellulases derived from Trichoderma reesei; 
cellulases derived from nonpathogenic mold; pectinases, 
, cellulases, arabinases, hemi -celluloses derived from Aspergillus 

n a v--pn aS os d— <v»d fro~ Penicillium lilacinum; endo- 

niger ; aextranases u 

glucose derived from nor. -pathogenic mold; pullulanases derived 
from Bacillus acidopullyticus; p-galactosidases derived from 

. .„.i ?na c es derived fro™ Trichoderma 

5 reesei; 

r ^-^iv a^^abie commercial carbohydrases 
Specific examoles of readily dUii - 

include Aloha-Gal-. Bio-Feed- Alpha, Bio-Feed™ Beta, Bio-Feed- 
Plus Bio-Feed- Plus, Novozyme© 138, Carezyme®, Celluclast®, 
30 Cellusoft®, Ceremyl®, Citrozym- Denimax-, D«yme™. 
Dextrozyme™, Finizym®, Fungamyl™. Gamanase™, Glucanex®, 
Lactozym®, Maltogenase™, Pentopan™. Pectinex™, Promozyme®, 
Pulpzyme™, Novamyl™, Term.amyl®, AMG (Amyloglucosidase Novo), 
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Maltogenase®, Sweetzyme®, Aquazym® (all enzymes available from 
Novo Nordisk A/S) . Other carbohydrases are available from other 
companies . 

5 It is to be understood that also variants of such carbohydrases 
are contemplated as the polypeptide of interest. 

The activity of carbohydrases can be determined as described in 
"Methods of Enzymatic Analysis", third edition, 1984, Verlag 
10 Chemie, Weinheim, vol. 4. 

Oxidoreductases are here meant to be ■ 'enzymes classified under the 
Enzyme Classification number E.C, 1 (Oxidoreductases) in 
accordance with the Recommendations (1992) of the International ■ 
15 Union of Biochemistry and Molecular Biology (IUBMB) . 

Examples include oxidoreductases selected from those classified 
under the Enzyme Classification (E.C. ) numbers: 

Glycerol-3 -phosphate dehydrogenase {NAD*) (1.1.1.8), Glycerol-3- 

20 phosphate dehydrogenase NAD ( P ) * (1.1.1.94), Glycerol-3 -phosphate 
1 -dehydrogenase (NADP) (1.1.1.94), Glucose oxidase (1.1.3.4), 
Hexose oxidase (1.1.3.5), Catechol oxidase (1.1.3.14), Bilirubin 
oxidase (1.3.3.5), Alanine dehydrogenase (1.4.1.1), Glutamate 
dehydrogenase (1.4.1.2), Glutamate dehydrogenase (NAD (P) *) 

25 (1.4.1.3), Glutamate dehydrogenase (NAD? 4 ) (1.4.1.4), L-Amino 
acid dehydrogenase (1.4.1.5), Serine dehydrogenase (1.4.1.7), 
Valine dehydrogenase (NAD?*) (1.4.1.8), Leucine dehydrogenase 
(1.4.1.9), Glycine dehydrogenase (1.4.1.10), L-Amino -acid oxidase 
(1.4,3.2.), D-Amino-acid oxidase (1 . 4 . 3 . 3) , L-Glutamate oxidase 

30 (1.4.3.11), Protein-lysine 6-oxidase (1.4.3.13), L-iysine oxidase 
(1.4.3.14), L-Aspartate oxidase (1.4.3.16), D-amino-acid 
dehydrogenase (1.4.99.1), Protein disulfide reductase (1.6.4.4), 
Thioredoxin reductase (1.6.4.5), Protein disulfide reductase 
(glutathione) (1.8.4.2), Laccase (1.10.3.2), Catalase (1.11.1.6), 

35 Peroxidase (1.11.1.7), Lipoxygenase (1.13,11.12), Superoxide 
dismutase (1.15.1.1) 
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Said Glucose oxidases may be derived from Aspergillus niger. 

Said Laccases may be derived from Polyporus pinsitus, 
Myceliophtora thermophila, Coprinus cinereus, Rhizoctonia solani, 
5 Rhizoctonia praticola, Scytalidium thermophilum and Rhus 
vernicifera . 

Bilirubin oxidases may be derived from Myrothecheci urn verrucaria. 

10 The Peroxidase may be derived from e.g. Soy bean, Horseradish or 
Coprinus cinereus. 

The Protein Disulfide reductase may be any mentioned in any of 
the DK patent applications no. 76B/93, 265/94 and 264/94 (Novo 
is Nordisk A/S) , which are hereby incorporated as reference, inclu- 
ding Protein Disulfide reductases of bovine origin, Protein 
Disulfide reductases derived fro- Aspergillus oryzae or Asper- 
gillus niger, and DsbA or DsbC derived from Escherichia coli. 

20 Specific examples of readily available commercial oxidoreductases 
include Gluzyme™ (enzyme available from Novo Nordisk A/S) . 
However, other oxidoreductases are available from others. 

It is to be understood that also variants of oxidoreductases are 
25 contemplated as the polypeptide of interest. 

The activity of oxidoreductases can be determined as described in 
"Methods of Enzymatic Analysis", third edition, 1984, Verlag 
Chemie, Weinheim, vol. 3. 

30 

In this context transferases are enzymes classified under the 
Enzyme Classification number E.C. 2 in accordance with the 
Recommendations (1992) of the Ir.t ernational Union of Biochemistry 
and Molecular Biology (IUBM3) . 

35 
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The transferases may be any transferase in the subgroups of 
transrerases : transferases transferring one-carbon groups (E.C. 
2.1); transferases transferring aldehyde or residues (E.C. 2.2); 
acyltransferases (E.C. 2.3); glucosyltransf erases (E.C. 2.4); 
5 transferases transferring alkyl or aryl groups, other that methyl 
groups (E.C. 2.5); transferases transferring nitrogenous groups 
(2.6) . 

In a preferred embodiment the transferease is a transglutaminase 
10 E.C 2.3.2.13 (Protein-glutamine y-glutamyltransf erase) . 

Transglutaminases are enzymes capable of catalysing an acyl 
transfer reaction in which a y-carboxyamide group of a peptide- 
bound glutamine residue is the acyl donor. Primary amino groups 

is in a variety of compounds may function as acyl acceptors with the '■ 
subsequent formation of mono-substituted y-amides of peptide- 
bound glutamic acid. When the epsilon-amino group of a lysine' 
residue in a peptide-chain serves as the acyl acceptor, the ' 
transferases form intramolecular or intermolecular y-glutamyl -e- 

20 lysyl crosslinks. 

Examples of transglutaminases are described in the pending DK 
patent application no. 990/94 (Novo Nordisk A/S) . 

25 The transglutaminase may the of human, aminal (e.g. bovine) or 
microbially origin. 

Examples of such transglutaminases are animal derived 
transglutaminases, FXIIIa; microbial transglutaminases derived 

30 from Physarum polycephalum (Klein et al . , Journal of Bacteriol- 
°9Yt 174, p. 2599-2605) ; transglutaminases derived from Strep- 
tomyces sp . , including Streptcr.yces lavendulae, Streptomyces 
lydicus (former Streptomyces libani) and Streptoverticillium sp., 
including Streptoverticillium mobaraense, Streptoverticillium 

35 cinnamoneum, and Streptoverticillium griseocarneum (Motoki et 
al., US 5,156,956; Andou et al . , US 5,252,469; Kaempfer et al., 
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journal of General Microbiology, 137, 1831-1892; Ochi et al . , 
international Journal of Sytercatic Bacteriology, 44, 285-292; 
Andou et al., US 5,252,469; Williams et al . , Journal of General 
Microbiology, 129, 1743-1813). 

It is to be understood that also transferase variants are 
contemplated as the polypeptide of interest. 

The activity of transglutaminases can be determined as described 
o in "Methods of Enzymatic Analysis" , third edition, 1984, Verlag 
Chemie, Weinheim, vol. 1-10. 

in this context phytases are enzymes classified under the Enzyme 

Classification number E.G. 3.1.3 (Phosphoric Monoester 

j ^ \ Ar, .rmrHAnre w< t" the Recommendations (1992) or the 
5 Hydrolases) m accoraance 

i nr^n n- B^oc^r.istry and Molecular Biology 
International Union o_ a - ot " J ' uS ' 

(IU3MB) . 

ohytas-s are enzvmes produced by microorganisms which catalyse 
o the conversion of phytate to inositol and inorganic phosphorus 

^„^; n - -^-oo^vsi^s comprise bacteria such as 
D hvtase Droaucmg L.iC^j-^-'- 31 '^ 

Bacillus subtilis, Bacillus natto and Pseudoruonas ; yeasts such as 

^v-o-rVc^o- a— funci such as Aspergillus niger, 
Saccharonyces cere/is^e, a..^ ~un^- 

. llMC r,-,,,,,. ;>s-v=-cillus a^ori, ' Aspergillus oryzae, 

, n n r ]lus nidulans, and various other 

Asoergillus terreus or -y-*^^ 

Aspergillus species) . 

Examoles of phytases include phytases selected from those 
30 classified under the Enzyme Classification (E.G.) numbers: 3- 
phytase (3.1.3.8) and 6-phytase (3.1.3.26). 

Th* activity of phytases car. be determined as described in 
"Methods of Enzymatic Analysis", third edition, 1984, Verlag 
35 Chemie, Weinheim, vol. 1-10, or may be measured according to the 
method described in EP-A1-0 420 358, Example 2 A. 
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In the present context an ant i -microbial polypeptides may be any 
polypeptide exhibiting ant i -microbial activities, such as anti- 
fungal, anti-bacterial, and/or anti-insecticidal activity. 

5 

Such polypeptides may also exhibit other activities such as 
enzymatic activity. 

Examples of ant i -microbial polypeptides according to the 
10 invention include: fungicidally active polypeptides derived from 
the mold genus Curvularia described in WO 94/01459 (Novo Nordisk 
A/S) ; anti-bacterial polypeptides.- described in EP 403.458 
(Kabigen AB) ; anti-microbial proteins isolated from the Mirabilis 
seed, described in WO 92/15691 (Imperial Chem Ind. PLC); anti- 
15 bacterial polypeptides isolated from an extract of pig small 
intestine, described in WO 92/22573 (Boman et al . ) ; polypeptide 
with yeast lethal action accumulated by yeast of Hansenula spp. 
as described in JP-60130599; Phytolacca insularis antiviral 
protein, which can be used as ar- anti-microbial described in US 
20 patent no. 5,348,865 (Jin Ro LTD.); bacteriolytic enzymes 
preparations derived from Nocardiopsis dassonvillei described in 
US patent no. 5,354,681 (Novo Industri A/S) . 

Examples of other ant i -microbial polypeptides are maganinin, 
25 protegrin , defensin, pseudcmycin, mutanolysin and N- 
acetylmuramidase . 

The present invention in a further aspect relates to a method for 
generating a DNA sequence encoding a desired variant of a 
30 polypeptide of interest, wherein 

(i) a mutant library is produced by the above method, 

(ii) said library is cultivated under' conditions conducive for 
the expression of said gene of interest to produce variant 
polypeptides, 
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(iii) said variant polypeptides are screened or. selected for a 
desired property, and hosts producing desired variants 
identified and isolated, 

(iv) the DNA encoding said variants is isolated. 

The present invention in a still further aspect relates to a 
method for the determination of a DNA sequence encoding a desired 
variant of a polypeptide of interest, wherein 

(i) a mutant, library is produced as described above, 

(ii) said library is cultivated under conditions conducive for 
the expression of said gene of interest to produce variant 
polypeptides, 

(iii) said variant polypeptides are screened or selected for a 
desired property, and hosts producing desired variants 
identified and isolated, 

(iv) said genetic element in said hosts is sequenced to 
elucidate the DNA sequence of the mutant gene encoding a 
desired variant. 

o This aspect of the invention can be performed by making dilutions 
of the library (e.g. in nicrotiter plates) and culturing these, 
whereby populations are made each originating from one member of 
the library, and the variant polypeptide produced from each of 
Che populations screened for the desired properties. 

5 Alternatively the library might be plated on agar-plates 
containing a desired growth medium that allows for the screening 
of or selection for desired properties of the variant 
polypeptide . 

30 If the phage display method is used, the screening or selection 
is performed directly with the phages. 

The criteria used for the selection will vary according to the 
end use of the polypeptide variant of interest, but properties 
35 typically being tested may include solubility and half-life in 
various media, antigenicity and allergenicity , thermal stability, 
oxidation stability, storage stability, substrate specificity and 
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affinity/ stability to non-aqueous solvents, pH profile, ionic 
strength dependence, catalytic efficiency, and compatibility with 
other components of envisaged end products wherein the 
polypeptide variant will form a part. 

For enzymes to be used in detergents further properties to be 
investigated are, wash performance and compatibility with various 
surfaces, especially fabrics. 

Numerous other criteria could be mentioned. 

Upon identification of populations that produce variant 
polypeptides fulfilling the criteria selected, the DNA encoding, 
the polypeptide variant of interest is isolated and sequenced by-* 
use of methods well known in the art. 

The invention furthermore comprises a process for the production 
of a desired polypeptide variant, wherein 

(i) a DNA sequence determined as indicated above is introduced 
into a suitable host in a manner whereby it can be 
expressed in said host, 

(ii) said host is cultivated under conditions conducive to the 
expression of said DNA sequence, and 

(iii) said polypeptide variant is recovered. 

The present invention can be used with any cell, especially any 
microbial cell, but it is often suitable to use a prokaryote, 
especially a bacterium, preferably of the genus Bacillus , etc. 

Among the Bacilli it is preferred to use a strain chosen from the 
group comprising B. lentus, B. licheniforruis , B . 
amyloliquefaciens, S. subtilis , etc. 

For some uses it is preferable to use a microbial cell which is a 
fungus, especially a filamentous fungus, preferably of the genus 
Aspergi 1 1 us , Tri ohoderma , e t c . 
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Among the Aspergilli it is preferable to use a strain chosen from 
the group comprising A. oryzae, A. nigez, A. awamori, etc. 

s Among the Trichoderma it is preferable to use a strain chosen 
from the group comprising T. reseei, etc. 

In yet other situations it is more expedient to use a mammalian 
cell chosen from the group comprising BKK, etc. cells. 



10 



The invention should not be construed to be limited to specific 
examples or embodiments mentioned 'in the specification above or 
the following examples. 



15 MATERIALS AND METHODS 



EXAMPLES 

20 

EXAMPLE 1 

The system used is an Eschericia coli host cell, which is 

characterized by a number of chromosomal mutations: 

i} a cs (thermoser.sitive) mutation in the polC gene ! encoding 

DMA polymerase III, being the main replication polymerase) 
ii) a mutation in the polA gene (encoding DNA polymerase I) 

causing an increased error rate by a reduction in the 3'- 

5' exonuclease activity. 
30 iii) repair deficiency by the mutL nutation. 

The target for the in vivo mutagenesis is plasmid pBR322 (colEl 
origin) having either (i) a frar.e shift mutation, or (ii) a stop 
codon" introduced into the tet gene, encoding a protein conferring 
35 resistance towards tetracyclic 



WO 97/25410 



PCT/DK97/00014 



29 

In each case the repair of the mutation leads to a dominant 
tetracycline resistant phenotype. 

pBR322 contains also the bla gene conferring resistance towards 
5 ampicillin. The higher mutagenesis frequency at the target region 
is seen as a higher frequency of tetracycline resistant colonies 
after plating a culture exposed to "mutation-introduction" 
conditions . 

10 An E. coli culture grown at 37°C to an optical density of l 
measured at 600 nm is exposed to 2, 4 or 16 hours at restrictive 
temperature, e.g. 42°C. At these time points dilution series of 
the cultures are plated on LBagar supplementet with 
1) ampicillin (AmpR colonies) 

15 2) tetracycline and ampicillin . (AmpR and tetR colonies) 

The ratio of tetracycline resistant colonies to ampicillin 

resistant colonies indicate the number of cells in the culture 

that contains one copy of a repaired tet gene, indicated one 
20 specific mutagenesis event. 

This means, if a clone have become tetracycline resistant, at 
least one specific mutation has occurred to repair the originally 
introduced gene defect. 
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PATENT CLAIMS 



1. A method for in vivo production of a library in cells 

comprising a multitude of mutated genetic elements, wherein an 
s error-prone polymerase is used in each ancestral cell to 
replicate all or a part of a genetic element comprising 

an origin of replication from which replication is 
initiated, 

optionally a genetic marker, e.g. a gene conferring 
resistance towards an antibiotic, 
a gene encoding the polypeptide of interest, 
independently of the host chromosomal replication machinery. 



i) 



in 



2. A method for in vivo production of a library in cells 

5 comprising a multitude of mutated genetic elements comprising 
A) providing a microbial cell having 

i) an error-prone polymerase that independently of the 

chromosomal replication machinery of said cell will 
replicate all or a part of a genetic element 
o comprising 

a) an origin of replication from which replication 
is initiated, 

b) optionally a genetic marker, e.g. a gene 
conferring resistance towards an antibiotic, 

5 c) a cene encoding the polypeptide of interest, 

and 

ii) a chromosomal replication machinery that can be 
reversibly induced to be substantially non- 
functional , 

30 B) growing such a cell under conditions conducive to its 

replication to obtain a multitude of ancestral cells, 
C) reversibly inducing said chromosomal replication machinery 

in said ancestral cells be substantially non- functional 
for a period of time sufficient to allow for the 

35 replication of said genetic element by said error-prone 

polymerase to generate mutations in said genetic element, 
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D) reversibly inducing said chromosomal replication machinery 
in such mutated cells to be substantially functional, and 

E) growing such mutated cells under conditions conducive to 
their replication. 

3. The method of claim 1 or 2, wherein said cell further 
comprises a deficient repair system. 

4. The method of any of the claims 1 to 3, wherein said 

> genetic element is a plasmid, a phagemid, a phage, a virus, a 
retrovirus or a retrotransposon . 

5. The method of any of the claims 1 to 4, wherein said cells 
are microbial cells. 

6. The method of any of the claims 1 to 5 , wherein said 
error-prone polymerase is selected from the group comprising DNA 
pol I; DNA pol II, reverse transcriptase and more specifically 
E.coli DNA pol I , Bacillus subtilis DNA pol I, HIV reverse 

) transcriptase, T4 DNA polymerase, 77 DNA polymerase, Phi29 DNA 
polymerase . 

7. The method of any of the claims 1 to 6 , wherein said 
chromosomal replication machinery that can be reversibly induced 

> to be substantially non- functional is a temperature sensitive E. 
coli DNA polymerase III 

8. A method for the determination of a DNA sequence encoding 
a desired variant of a polypeptide of interest, wherein 

o (i) a mutant library is produced by the method of any of the 
claims 1 to 7 , wherein said genetic element comprises a 
gene encoding said polypeptide of interest, 
(ii) said library is cultivated under conditions conducive for 
the expression of said gene of interest to produce variant 

5 polypeptides, 
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(ii-i) said variant polypeptides are screened or selected for a 

desired property, and hosts producing desired variants 

identified and isolated, 
(iv) said genetic element in said hosts is sequenced to 

elucidate the DNA sequence of the mutant gene encoding a 

desired variant. 

9. A method for generating a DNA sequence encoding a desired 
variant of a polypeptide of interest, wherein 

(i) a mutant library is produced by the method of any of the 
claims 1 to 7, wherein said genetic element comprises a 
gene encoding said polypeptide of interest, 

(ii) said library is cultivated under conditions conducive for 
the expression of said gene of interest to produce variant 
polypeptides , 

(iii) said variant polypeptides are screened or selected for a 
desired property, and hosts producing desired variants 
identified and isolated, 

(iv) the DNA encoding said variants is isolated. 

10. A process for the production of a desired polypeptide 
variant, wherein 

(i) a DNA sequence obtained according to claim 9 or determined 
according to claim 8 is introduced into a suitable host in 
a manner whereby ic can be expressed in said host, 

(ii) said host is cultivated under conditions conducive to the 
expression of said DNA sequence, and 

(iii) said polypeptide variant is recovered, 

11. A method of any of the claims 1 to 10, wherein said 
polypeptide of interest is an enzyme. 

12. A method of claim 11, wherein said enzyme is a carbonyl 
hydrolases , carbohydrases , oxidoreductases , transferases , 
phytases, ligases, lyases, and anti-microbial polypeptides. 



WO 97/25410 



PCT/DK97/00014 



35 

13. - A method of claim 12, wherein said carbonyl hydrolase is a 
protease, or a lipase. 

14. A method of claim 12, wherein said carbohydrase is an 
5 amylase, glucosidase, cellulase, glucanase, xylanase, dextranase, 

chi tinase , polygalacturonase , lysozyme , glucosidase , 

galactosidase, xylosidase, arabinosidase, lactase, chitonanase, 
xylose isomerase, pectin esterase, rhamnogalacturonase, endo- 
glucanase . 

10 

15. A method of claim 12, wherein said oxidoreductase is a 
dehydrogenase, oxidase, reductase, Laccase, Catalase, Peroxidase, 
Lipoxygenase, Superoxide dismutase. 

15 16. A method of claim 12, wherein said transferase is a 
transferase transferring one-carbon groups, a transferase 
transferring aldehyde or residues, acyltransf erase, * 

glucosyltransf erase , transferase transferring alkyl or aryl ■ 
groups, other that methyl groups, transferase transferring 

20 nitrogeneous groups. 

17. A method of any. of the claims 1 to 16, wherein said cell 
is a prokaryote, especially a bacterium, preferably of the genus 
Bacillus, Escherichia, Staphylococcus , and Streptococus . 

25 

18. A method of claim 17, wherein said Escherichia is a strain 
chosen from E. coli. 

19. A method of claim 17, wherein said Bacillus is a strain 
30 chosen from the group comprising B . lentus, B. licheniformis t B. 

amyloliquefaciens, B. subtiiis. 

20. A method of any of the claims 1 to 16, wherein said cell 
is a ' fungus, especially a yeast or a filamentous fungus, 

35 preferably of the genus Aspergillus , Trichoderma. 
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21. ' A method of claim 20, wherein said Aspergillus is chosen 
from the group comprising A. oryzae, A. niger, A. awamori. 

22. A method of claim 20, wherein said Trichoderma is chosen 
from the group comprising T. reseei. 

23. A method of any of the claims 1 to 16, wherein said cell 
is a mammalian cell chosen from the group comprising BHK cells or 
an insect cell. 
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